Model Selection

Local Window Attention

# Local Window Attention

Swinv2 Base Patch4 Window12 192 22k

Swin Transformer v2 is a vision Transformer model that achieves efficient image processing through hierarchical feature maps and local window self-attention mechanisms.

Image Classification

Swin Large Patch4 Window12 384

Swin Transformer is a hierarchical vision Transformer model based on shifted windows, specifically designed for image classification tasks.

Image Classification

Swin Base Patch4 Window7 224 In22k

Swin Transformer is a hierarchical window-based vision Transformer model pretrained on the ImageNet-21k dataset, suitable for image classification tasks.

Image Classification

Swin Small Patch4 Window7 224

Swin Transformer is a hierarchical window-based vision Transformer model designed for image classification tasks, with computational complexity linearly related to input image size.

Image Classification

Swin Large Patch4 Window12 384 In22k

Swin Transformer is a hierarchical window-based vision Transformer model, pretrained on the ImageNet-21k dataset, suitable for image classification tasks.

Image Classification

Swin Tiny Patch4 Window7 224

Swin Transformer is a hierarchical vision Transformer that achieves linear computational complexity by computing self-attention within local windows, making it suitable for image classification tasks.

Image Classification

Swin Large Patch4 Window7 224

Swin Transformer is a hierarchical vision Transformer that achieves linear computational complexity by computing self-attention within local windows, making it suitable for image classification and dense recognition tasks.

Image Classification

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase